WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

C12N 15/12, 15/13, 15/62, 15/79, 9/72, 
9/62, C07K 14/705 



Al 



(11.) International Publication Number: WO 99/53059 

(43) International Publication Date: 2! October 1999 (21.10.99) 



(21) International Application Number: PCT/US99/08 1 10 

(22) International Filing Date: 14 April 1999 (14.04.99) 



(30) Priority Data: 

60/082,002 
60/123,522 



16 April 1998 (16.04.98) US 
8 March 1999 (08.03.99) US 



{71) Applicant: GENENTECH, INC. [US/US]; 1 DNA Way, South 
San Francisco, CA 94080-4990 (US). 

(72) Inventors: ASHKENAZI. Avi, J.; 1456 Terrvtown Street. 
San Mateo, CA 94402 (US). BERMAN, Phillip, W.; 
95 Cheyenne Point, Portola Valley, CA 94028 (US). 
BROUSSEAU, David; 1274 43rd Avenue, San Francisco, 
CA 94127 (US). ETCHEVERRY, Tina; 1471 Greenwood 
Ten-ace, Berkeley, CA 94708 (US), 

(74) Agents: KUBINEC, Jeffrey, S. et al.; Genentech, Inc., 1 DNA 
Way. South San Francisco, CA 94080-4990 (US). 



(81) Designated States: AE, AL, AM, AT, AU, AZ, BA, BB. BG, 
BR, BY, CA, CH, CN, CU, CZ, DE, DtC, EE, ES, FI. GB, 
GD, GE, GH, GM, HR, HU, ID, IL, IN, IS. JP. KE, KG, 
KP. KR, KZ, -LC, LK, LR, LS, LT, LU, LV, MD, MG, MK, 
MN, MW, MX, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, 
SK, SL, TJ, TM, TR, TT, UA, UG, UZ, VN, YU, ZA. ZW, 
ARIPO patent (GH, GM, KE, LS, MW, SD, SL, SZ, UG. 
ZW), Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, 
TM), European patent (AT, BE, CH, CY, DE, DK, ES. FI, 
FR, GB, GR, IE, IT, LU, MC, NL, PT, SE), OAPI patent 
(BF, BJ, CF, CG, CI, CM. GA, GN, GW, ML. MR, NE, 
SN, TD. TG). 



Published 

With international search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: SECRETION OF GLYCOSYLATED PROTEINS USING A TISSUE PLASMINOGEN ACTIVATOR PRO-SEQUENCE 
(57) Abstract 



DNA constructs, host cells and production methods are disclosed for the expression and recovery of polypeptides, especially those 
altered to have one or more glycosylation sites added or deleted. The DNA constructs, host ceils and methods provided herein employ a 
DNA segment corresponding to a mammalian tissue plaminogcn activator signal and/or pro-peptide. 



BA 



FOR THE PURPOSES OF IN FORMA TION ONLY 



Codes used to identity States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


KS 


Spain 


I..S 


I .csotho 


SI 


Slovenia 


AM 


Armenia 


1*1 


Hin land 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


KR 


France 


l,U 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AX 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia arid Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugos lav- 


TM 


Turkmenistan 


HI" 


Burkina Kaso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


IIU 


Hungary 


ML 


Mali 


n 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


LA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MVV 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


UZ 


Uzbekistan 


CK 


Central African Republic 


JF 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NI- 


Netherlands 


Yli 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


CI 


Cole d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


KO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian i-'ederation 






DE 


Germany 


I J 


Liechtenstein 


SI) 


Sudan 






OK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






KK 


Estonia 


LR 


Liberia 


SG 


Singapore 




t 



WO 99/53059 PCT/US99/081 1 0 

SECRETION OF GLYCOSYLATED PROTEINS USING A TISSUE PLASMINOGEN 

ACTIVATOR PRO-SEQUENCE 

Background of the Invention 

Ticiii Oi* mic Iiivciuiuri 

The present invention relates generally to the field of protein production by celi culture. In particular 
aspects, the invention provides methods and compositions for the production of polypeptides, especially by 
eukaryotic ceil culture, which enhance secretion and facilitate recovery of the polypeptides. 
Description of Related Disclosures 

Many eukaryotic ceil surface- and secreted proteins are post-rransiationaily processed to incorporate 
N-linked and O-linked carbohydrate (Komfeld and Komfeld (1985) Annu. Rev. Biochem. 54:631-64; 
Rademacher et aL (1988) Annu. Rev. Biochem. 57:785-838). Protein glycosyiation is thought to subserve 
a variety of functions including: augmentation of protein folding, inhibition of protein aggregation, regulation 
of intracellular trafficking to organelles, increasing resistance to proteolysis, modulation of protein 
antigenicity, and mediation of intercellular adhesion (Fieldler and Simons ( 1 995) Cell 8 1 :309-3 1 2: Helenius 
(1994) Mol. Biol, of the Cell 5:253-265: Olden etaL, (1978) Cell. 13:461-473: Caton etaL. (1982) Cell 
37:417-427: Alexander and Eider (1984) Science 226:1328-1330: Flack et aL (1994 J. Biol. Chem. 
269:14015-14020). In higher organisms, the nature and extent of glycosyiation can markedly affect the 
circulating half-life and bio-avaiiability of secreted proteins by mechanisms involving receptor mediated 
uptake and clearance (Ashwell and Morreil (1974) Adv. Enzymol. 41 :99-128: Ashwell and Harford (1982) 
Ann. Rev. Biochem. 5 1 :53 1-54). Receptor systems have been identified that are thought to play a major role 
in the clearance of serum proteins through recognition of various carbohydrate structures on the glycoproteins 
(Stockert(1995) Physiol. Rev. 75:591-609: Kery etaL. (1992) Arch. Biochem. Biophys. 298:49-55). Thus, 
production strategies resulting in incomplete attachment of terminal sialic acid residues might provide a means 
of shortening the bioavailability and half-life of secreted glycoproteins by promoting clearance by the hepatic 
asiaioglycoprotein receptor. Conversely, expression strategies resulting in saturation of terminal sialic acid 
attachment sites might lengthen protein bioavailability and half-life by preventing uptake by hepatic receptors. 

In the development of recombinant glycoproteins for use as pharmaceutical products, it has been 
speculated that the pharmacodynamics of recombinant proteins can be modulated by the addition or deletion 
of glycosyiation sites from a glycoprotein's primary structure (Berman and Lasky (1985a) Trends in 
Biotechnol. 3:5 1-53). However, studies have reported that the deletion of N-linked glycosyiation sites often 
impairs intracellular transport and results in the intracellular accumulation of glycosyiation site variants 
(Machamer and Rose (1988) J. Biol. Chem. 263:5955-5960: Gallagher et ai % (1992) J. Virology 66:7136- 
7145; Collier et a/.., (1993) Biochemistry 32: 7818-7823; Claffey et aL. (1995) Biochemica et Biophysica 
Acta 1246 :l-9; Dube era/.., (1988) J. Biol. Chem. 263:17516-17521). While glycosyiation site variants of 
secreted proteins can be expressed intracellularly. it has proved difficult to recover useful quantities from 
growth conditioned cell culture medium. 

It has been shown that impaired secretion of the HIV-1 envelope glycoprotein containing N-linked 
carbohydrate structures can be partially overcome by replacing the native signal sequence of the glycoprotein 
with the signal sequence and 5' untranslated region of the herpes simplex virus type I glycoprotein D (HSV 

-1- 



SUBST1TUTE SHEET (RULE 26) 



WO 99/53059 



PCI7US99/08110 



gD-l)(Lasky etai... (1986) Science 233:209-212: Berman et aU (1989) J. Virol. 63:3489-3498). Studies 
have reported the use of the native secretory peptide of human tissue type plasminogen activator in cell culture 
proauction or non-t-PA glycoproteins (International Publication No. WO 96/17067: Krasney and Young 
( 1 992) Cytokine 4:134-143; Rhodes etal..*(\ 994) J . Gen. Virol. 75:207-213). None have reoorted a strategy 
for the secretion and recovery of proteins wherein one or more native giycosyiation sites have been added to 
or deleted from the protein's primary structure. 

Summary of the Invention 

The present invention provides methods and compositions useful in the production of proteins by 
recombinant cell culture. The methods and compositions overcome intracellular retention of proteins and 
facilitate extracellular recovery of the produced proteins. In particular embodiments, the invention provides 
methods and compositions for the production of polypeptides other than tissue-type plasminogen activators 
(t-PA's) utilizing a precursor peptide corresponding to the ammo terminal signal and/or pro peptides naturallv 
associated with a mammalian t-PA which act to direct the secretion of the mature t-PA. In particular 
embodiments, the invention provides for the export and secretion of polypeptides other than t-PA's. For 
example, the invention provides methods and compositions useful in the recombinant production and 
extracellular recovery of novel chimeric or fusion proteins such as immunoadhesins and especially those with 
poor secretion kinetics. In particular embodiments the invention provides for the production and recovery of 
glycoproteins that have been altered from their native sequence to add or remove one or more giycosyiation 
sites. Advantageously, the invention improves and in preferred embodiments restores the export and secretion 
of such giycosyiation site variants. It is a further advantage of the present invention that greater quantities of 
the polypeptides produced utilizing the compositions and methods of the present invention can be recovered 
from the extracelluar medium of the cultured ceils from which they are produced. 

Accordingly, in particular embodiments, the invention provides nucleic acids, expression systems 
and host ceils, as well as methods for the production of non t-PA polypeptides, such as immunoadhesins and 
especially giycosyiation site variant polypeptides, in cell culture. In one embodiment of the present invention, 
a DNA construct is provided for the export and secretion of such polypeptides comprising a first DNA 
segment encoding a precursor peptide corresponding to ail or a portion of the ammo terminal signal and/or 
pro peptides naturally associated with a mammalian and preferably a human t-PA. The first DNA segment 
is operably linked, in frame, to a second DNA segment encoding the heterologous i non-t-PA) polypeptide. 

In particular aspects, the first DNA segment encodes a peptide corresponding to a mammalian and 
preferably a human t-PA pro-sequence as defined herein. According to this aspect of the present invention, 
the first DNA segment is operably linked to a second DNA segment encoding a heterologous polypeptide. 

Certain embodiments additionally comprise a DNA segment encoding a peptide corresponding to a 
mammalian signal and or pro-peptide upstream of and operably linked to the first DNA segment. For 
example, when the heterologous polypeptide is an immunoadhesin. such as a TNFR-IgG chimera, the first 
DNA segment may be preceded by a DNA segment encoding a mammalian t-PA or. alternatively, a 
heterologous signal and or pro-sequence. According to this aspect of the invention the mammalian t-PA signal 
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sequence or other heterologous signal and or pro-sequence and the pro-peptide amino acid sequence comprise 
the precursor peptide of the invention, 

In preferred embodiments, the first DNA se&menr encnHei a narivp nr nanir^Ny occurring signai and 
pro peptide of a mammalian t-PA and especially a human t-PA. The second DNA sequence preferably 

5 encodes a polypeptide other than a mammalian t-PA such as. for example, an immunoadhesin. According to 
further aspects of the invention, the second DNA sequence encodes a naturally occurring or chimeric 
polypeptide other than a mammalian t-PA wherein one or more giycosylation sites have been added to or 
deleted from the polypeptide's native sequence. The invention runner provides a DNA construct comprising 
one or more additional DNA segments operabiy linked to the first and second DNA segments. 

0 The invention further provides a cultured eukaryotic host cell comprising a DNA construct having 

a first DNA segment encoding a precursor peptide the precursor peptide corresponding to all or a portion of 
a mammalian t-PA signal and or pro peptide and a second DNA segment operabiy linked to the first DNA 
sequence, the second DNA sequence encoding a polypeptide other than a mammalian t-PA and preferably a 
polypeptide other than a mammalian t-PA wherein one or more giycosylation sites have been added to or 

5 deleted from the polypeptide. 

The invention further provides methods of producing a heterologous polypeptide especially a 
polypeptide which has been altered to delete one or more native N-linked giycosylation sites comprising the 
steps of 

(a) cuituring a eukaryotic host ceil comprising a DNA construct, the DNA construct comprising: 

0 a first DNA segment encoding a precursor peptide corresponding to ail or a portion of a mammalian 

tissue plasminogen activator signal and or pro peptide: and 

a second DNA segment operabiy linked to the tlrst DNA sequence, the second DNA sequence 
encoding a polypeptide other than t-PA wherein one or more native N-linked giycosylation sites of the 
polypeptide have been deleted from the polypeptide wherein the eukaryotic host cell expresses the first and 
second DNA segments and the polypeptide is secreted from the ceil: and 

(b) recovering the polypeptide so produced. 
Brief Description of the Drawings 

Figure 1 . Diagram of a tumor necrosis factor immunoglobulin chimeric molecuie (TNFR-IgG 1 ) and 
signal sequences. TNFR-IgG 1 is a chimeric protein consisting of the extracellular domain of the p55 TNF 
receptor fused to the hinge and Fc domain of an immunoglobulin heavy chain. TNFR-IgG 1 is secreted as a 
homodimer with four N-linked giycosylation sites (squares) per monomer. The proteins expressed in this 
study were synthesized using the wild type TNFR signal sequence containing 29 amino acids (SEQ ID NO: 
2), or a combination of the TNFR signal sequence and the signal and/or pro-sequence of human tissue 
plasminogen activator (tPA) (SEQ ID NO: I). 

Figures 2A-2D. Effect of the signal sequence on the secretion kinetics of TNFR-IgG 1 and 
tPA.TNFR-IgG 1 . Parallel cultures of human embryonic kidney cells were transfected with plasmids encoding 
either TNFR-IgG 1 utilizing the native signal sequence or the signal and pro-sequence of t-PA (tPA.TNFR- 
IgGl) for pulse labeling experiments. Two days post-transfection the culture, medium was removed and 
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replaced (after washing two times) with methionine- and cysteine-rree DMEM supplemented with ( 35 SV 
labeied methionine and cysteine. Cells were labeled at 37°C (5% C0 2 ) for the times indicated. The labeling 
reaction was terminated by washing the cells immediately in chilled (4°C) PBS followed by dererpenr 
extraction. Cell lysates and cell culture supernatants were precipitated by the addition of Protein A 
SEPHAROSE™ The Protein A:TNFR-IgG 1 complexes were pelleted by centrifugation. washed repeatedly 
and eluted in SDS-PAGE sample buffer containing mercaptoethanol. The eluted protein was resolved on 1 0% 
SDS-PAGE gels and visualized by autoradiography. Figure 2A. intracellular (i.e. ceil associated) TNFR- 
IgGl: Figure 2B. Secreted TNFR-IgGl; Figure 2C, intracellular tPA.TNFR.IgG; Figure 2D. secreted 
tPA.TNFR.IgG. 

Figures 3A-3F. Secretion kinetics of TNFR-IgGl signal sequence and glycosylation mutants. 
Plasmids encoding fully glycosylated (Figure 3A) or mutated forms of TNFR-IgG 1 (Figures 3B-3F) were 
transfected into 293 ceils and analyzed by pulse-chase analysis as described in Figure 2. Autoradiographs 
were analyzed with a scanning densitometer and the optical density values for supernatants ( o) and cell lysates 
( } were normalized and plotted as a function of time (0-24 hr). The glycosylation site mutants are identified 
using the single letter amino acid code with reference to the amino acid residue present at each of the four re- 
linked glycosylation sites (amino acid positions 14, 105. 111. and 248). Thus, the designation NNNN 
represents the wild type TNFR-IgGl. whereas NQNN indicates TNFR-IgGl where giutamine replaced 
asparagme at the second glycosylation site (position 105). Figure 3A. TNFR-IgGl; Figure 3B. tPA.TNFR- 
IgGl; Figure 3C TNFR-IgG I mutant lacking 2 native glycosylation sites (NNQQ); Figure 3D. tPA.NNQQ; 
Figure 3 E,QSNQ; Figure 3F, tPA.QSNQ. 

Figures 4A-4C. Immunoprecipitation of TNFR-IgGl glycosylation mutants. Plasmids encoding 
either TNFR-IgGl or TNFR-IgG glycosylation site mutants were transfected into 293 cells using calcium 
phosphate precipitated DNA. Two days post-trans fection the culture, medium was removed and replaced 
(after washing two times) with methionine- and cysteine-free DMEM supplemented with ( j5 S)-labeied 
methionine and cysteine. Cells were labeled overnight at 37°C (5% COt). Cell lysates and cell culture 
supernatants were precipitated by the addition of Protein A SEPHAROSE™. The Protein A:TNFR-IgG 1 
complexes were pelleted by centrifugation, washed repeatedly and eluted in SDS-PAGE sample buffer 
containing mercaptoethanol. The eluted protein was resolved on 10% SDS-PAGE gels and visualized by 
autoradiography. The glycosylation site mutants were examined in four separate experiments (Figures 4 A-4D) 
where transiently expressed TNFR-IgGl (TNFR-IgGl) expressed in 293 cells or a stable CHO cell line 
expressing TNFR-IgG (TRY+) were used as positive controls. Background binding was determined in 
experiments with ceils transfected with thrombopoeitin (TPO-). The mobilities of molecular weight markers 
are indicated at the left margins. 

Figures 5 A and 5B. Effect of signal sequence replacement on the secretion of TNFR-IgGl 
glycosylation site mutants. Plasmids encoding TNFR-IgGl glycosylation site mutants containing the tPA 
signal/pro sequence (tPA) (SEQ ID NO: I) or the TNFR signal sequence (TNFR) (SEQ ID NO: 2) were 
transfected into 293 cells, metabolically labeled with ( 35 S) methionine and cysteine, and immunoprecipitated 
from ceil culture supernatants (S) or ceil lysates (L) by the addition of Protein A SEPHAROSE™ as described 
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in Figure 4. The Protein A:TNFR-lgGl complexes were pelleted by centnfugation. resolved by SDS-PAGE. 
and visualized by autoradiography. The mobilities of molecular weight markers are indicated at the left 
margins. 

Figure 6. Characterization of TNFR-IgGl glycosylation mutants purified for receptor binding 
5 studies. TNFR-IgG 1 glycosylation site mutants were purified by protein A affinity chromatography from 
growth conditioned cell culture supernatants of stabile transfected CHO cells. The purified proteins were 
treated with SDS-PAGE sample buffer with- (lanes 8- 1 2) or without ( lanes 3-6) added 2-mercaptoethanol and 
resolved by SDS-PAGE. The resulting gel was stained with Coomassi blue. Each lane contained 3 ug of 
purified protein. Lanes 3 and 8 contained fully glycosylated TNFR-IgG 1 : lanes 4 and 9 contained the NNQQ 
10 glycosylation site mutant; lanes 5 and 10 contained the NSNQ glycosylation site mutant; lanes 6 and 1 1 
contained the QSNQ glycosylation site mutant. Lane 1 contained molecular weight standards (indicated at 
left margin), lanes 2.7. and 12 contained sample buffer alone. 

Figures 7A-7D. Competitive binding of ( l25 I)-Iabeied TNF to TNFR-IgG 1 glycosylation mutants. 
Purified TNFR-IgG 1 and TNFR-IgG 1 glycosylation site mutants were captured onto micotiter plates coated 

1 5 with affinity purified goat antibodies to human IgG Fc domain. The captured receptor chimeras were reacted 

1 "*5 

with ( - D-Iabeled TNF along with varying concentrations of unlabeled TNF. The binding of TNF to fully 
glycosylated TNFR-IgG 1 is shown in Figure 7 A; binding to the NNQQ glycosylation mutant is shown in 
Figure 7B; binding to the NSNQ glycosylation mutant is shown in Figure 7C; and binding to the QSNQ 
glycosylation mutant is shown in Figure 7D. 

20 Figure 8A-C. Secretion kinetics of TNFR-IgG 1 signal sequence variants. Plasmids encoding 

TNFRI-IgG 1 utilizing the signal sequence of TNFRI (SEQ ID NO: 2)(Figure 8A). the signal-pro sequence 
of tPA (SEQ ID NO: l)(Figure 8B)orthe signal sequence of TNFRI (SEQ ID NO: 2) and a pro-sequence 
(SEQ ID NO: 7) of human t-PA (SEQ ID NO: 8) (Figure 8C) were transfected into CHO cells and analyzed 
by pulse-chase analysis as described in Figure 2. Autoradiographs were analyzed with a scanning 

25 densitometer and the optical density values for supernatants (u) and cell lysates << ) were normalized and 
plotted as a function of time (0-24 hr ). 

Detailed Description of the Preferred Embodiments 

Definitions 

The terms "DNA sequence encoding", M DNA encoding" and "nucleic acid encoding" refer to the 
30 order or sequence of deoxyribonucleotides along a strand of deoxyribonucleic acid. The order of these 
deoxyribonucleotides determines the order of amino acids along the polypeptide chain. The DNA sequence 
thus codes for a particular amino acid sequence of a polypeptide. 

The terms "t-PA" and "tissue plasminogen activator" refer to a naruraiiy occurring extrinsic (tissue- 
type) plasminogen activator having fibrinolytic activity and typically having a structure with five domains 
35 (finger, growth factor, kringle-1. kringle-2, and protease domains). Naturally occurring mammalian t-PA 
includes human species as well as other animal species such as rabbit, rat, porcine, non human primate, 
equine, murine, and ovine t-PA. Nucleic acids encoding t-PA from human and non-human species is known 
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in the art. For example, human t-PA is encoded by the cDNA sequence reported in U.S. Patent Number 
4 .766.075. issued 23 August 1988. 

A "precursor peptide" as used in the context of the present invention and as more fully described 
herein below, is used to refer to a polypeptide having an amino acid sequence corresponding to all or a portion 
5 of a naturally occurring mammalian t-PA signal and/or pro peptide which participates in the secretion of t-PA 
under native conditions. 

"Pre-pro" or "signal-pro" peptide as used in the context of the present invention is meant to refer to 
an amino acid sequence such as that naturally associated with a mammalian t-PA which acts to direct the 
secretion of a mature polypeptide, for example, a mammalian t-PA. from a cell. As used herein, the term 

10 "signal-pro peptide" includes the "pre-" or "signal" sequence such as that naturally associated with a 
mammalian t-PA which functions to bind to the signal-recognition particle and direct the protein to the lumen 
of the endoplasmic reticulum (ER). A "signal" sequence is an amino acid sequence, characteristically 
hydrophobic in nature, cleaved by signal peptidases in the ER. For example, the signal sequence of t-PA is 
generally removed from the nascent t-PA co-rranslationally. In addition to a signal sequence, some 

1 5 mammalian proteins are associated post-translationally with a "pro" sequence such as the signal-pro sequence 
of a mammalian t-PA. The "pro-" sequence serves to target the nacsent polypeptide, for example a mature 
t-PA, to the Golgi apparatus (GA) and is cleaved post-transiationally. The pro-sequence is characteristically 
positively charged and commonly contains golgi peptidase cleavage sequences. 

The amino acid sequences of the mammalian t-PA signal-pro peptides are generally known or 

20 obtainable through conventional techniques. Signal-pro peptides from human and non-human t-PAs are 
known in the art and have been disclosed in. for example U.S. Patent 4.766.075: Rickles etai, (1988) J. Biol. 
Chem. 263:1563-1560 and Feng et a... (1990) J. Biol. Chem. 265:2022-2027. The term native signal-pro 
peptide specifically encompasses naturally-occurring pre- and prepro- sequences as defined and includes 
naturally-occurring variant forms (e.g., alternatively spliced forms) and naturally-occurring allelic variants 

25 of the t-PA signal-pro peptide. DNA sequences encoding mammalian t-PA amino terminal signal-pro peptides 
can be cloned as cDNA or genomic molecules according to techniques that are standard in the art or can be 
synthesized, preferably using automated equipment and the application of conventional synthetic protocols. 

As used herein, "heterologous polypeptide" refers generally to polypeptides and proteins regardless 
of their origin other than a t-PA or a variant thereof and generally having more than about ten amino acids 

30 which may or may not have one or more native or synthetic sites for the attachment of a carbohydrate. 
Preferably, heterologous polypeptides are "heterologous glycoproteins", that is a heterologous polypeptide as 
described above having one or more sites for the attachment of a carbohydrate in their native sequence. 
Examples of heterologous glycoproteins include molecules such as cytokines and their receptors, as well as 
chimeric proteins comprising cytokines or their receptors, including, for instance tumor necrosis factor alpha 

35 and beta, their receptors (TNFR-1; Gray et a/... (1990) Proc. Natl. Acad. Sci. USA 87:7380-7384: and TNFR- 
2: Kohno et ai., (1990) Proc. Natl. Acad. Sci. USA 87:8331-8335) and their derivatives: renin: a growth 
hormone, including human growth hormone, and bovine growth hormone; growth hormone releasing factor: 
parathyroid hormone: thyroid stimulating hormone: lipoproteins; alpha- 1 -antitrypsindnsulin A-chain: insulin 
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B-chain: proinsulin: follicle stimulating hormone: calcitonin; luteinizing hormone; glucagon; dotting factors 
such as factor VHIC factor (X. tissue factor, and von Willebrands factor; anti-clotting factors such as Protein 
C: atrial natriuretic factor: lung surfactant; a plasminogen activator other rhan a ticcn^.n^- ^Ur^;^ 

" r - r* — — 

activator (t-PA), for example a urokinase: bombesin: thrombin: hemopoietic growth factor: enkephalinase; 
5 RANTES (regulated on activation normally T-cell expressed and secreted); human macrorhrg? inflammatory 
protein ( MIP- 1 -alpha):a serum albumin such as human serum albumin: mullerian-inhibiting substance: relaxin 
A-chain: relaxin B-chain; proreiaxin; mouse gonadotropin-associated peptide: a microbial protein, such as 
beta-lactamase: DNase; inhibin; activin; vascular endothelial growth factor (VEGF): receptors for hormones 
or growth factors: integrin; protein A or D: rheumatoid factors; a neurotrophic factor such as bone-derived 
1 0 neurotrophic factor (BDNF), neurotrophin-3. -4, -5, or -6 (NT-3. NT-4. NT-5. or NT-6). or a nerve growth 
factor such as NGF-p: platelet-derived growth factor (PDGF): fibroblast growth factor such as aFGF and 
bFGF: epidermal growth factor ( EGF); transforming growth factor ( TGF) such as TGF-aipha and TGF-beta, 
including TGF-pl, TGF-P2, TGF-P3. TGF-P4. or TGF-P5; insulin-like growth factor-I and -II (IGF-I and 
IGF-II); des( 1 -3)-IGF-l (brain IGF-I), insulin-like growth factor binding proteins: CD proteins such as CD-3. 
1 5 CD-4. CD-8. and CD- 19; erythropoietin: osteoinductive factors: immunotoxins: a bone morphogenetic protein 
(BMP); an interferon such as interferon-alpha, -beta, and -gamma: colony stimulating factors ( CSFs), e.g. , M- 
CSF. GM-CSF. and G-CSF; interieukins (ILs), e.g., IL-1 to IL- 10; superoxide dismutase; T-cell receptors: 
surface membrane proteins: decay accelerating factor: viral antigen such as, for example, a portion of the HI V- 
1 envelope glycoprotein. gpl20. gpl60 or fragments thereof; transport proteins: homing receptors: addressins; 
20 regulatory proteins: antibodies and chimeric proteins, such as immunoadhesins. 

Preferably the heterologous glycoproteins are "glycosylation site variants" of any of the heterologous 
glycoproteins described above. More particularly, the glycosylation site variants according to the present 
inventionare naturally occurring heterologous glycoproteins or variations thereof such as the chimeric proteins 
described above, having at least one site for glycosylation. preferably an N-linked glycosylation site, that has 
25 been added to or deleted from their native sequence. Such glycosylation site variants include both 
"glycosylation site addition variants" as well as "glycosylation site deletion variants" so named herein to 
delineate heterologous glycoproteins as defined above having one or more sites for glycosylation. preferably 
N-linked. added to or deleted from their native sequence, respectively 

The heterologous glycoproteins of the present invention have one or more carbohydrate structures 
30 that occur on the native protein expressed as N-linked or O-linked carbohydrates. The N-linked and O-linked 
carbohydrates differ primarily in their core structures. N-linked glycosylation refers to the attachment of the 
carbohydrate moiety via N-acetylglucosamine (GlcNAc) to an asparagine residue in the peptide chain. The 
amino acid sequence of the heterologous polypeptide will contain an asparagine-X-serine. asparagine-X- 
threontne, or asparagine-X-cysteine. wherein X is any amino acid except proline. O-linked carbohydrates, 
35 by contrast are characterized by a common core structure, which is the N-acetylgalactosamine (GaiNAc) 
attached to the hydroxy! group of a threonine or serine in the native amino acid sequence of the heterologous 
polypeptide. 
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The terms "tumor necrosis factor receptor" and "TNFR" in the context of the present invention refer 
to a polypeptide comprising the amino acid sequence of a native TNF binding polypeptide, or any 
combination, derivative or fragment thereof which is capable of binding to tumor necrosis factnr-rc and/or .p 
(TNF-a and/or -P). This definition includes substantially intact ceil surface or soluble type I and type 2 TNFR 
5 polypeptides (TNFR 1 and TNFR2 ) from natural sources, synthetically produced in vitro or obtained by genetic 
manipulation including methods of recombinant DNA technology, as well as various chain combinations of 
such polypeptides. Native TNFR include human and non-human animal, e.g. murine, bovine, equine, porcine, 
etc. TNFR polypeptides. The non-human mammalian species can. for example, be obtained by cross-species 
hybridization, using probes obtained from the human DNA sequence as hybridization probes to isolate TNFR 

10 cDNAs from the respective non-human mammalian cDNA libraries. 

The ch imeric gene encoding TNFR 1 -IgG 1 is described in Ashkenazi et aL. . ( 1 99 \) Proc. Natl. Acad. 
Sci. 88:10535-10539. TNFR 1 -IgG 1 is a chimeric protein constructed by fusing the extracellular domain of 
the receptor type 1 for TNF alpha with sequences encoding the Fc domain and hinge region of IgGl 
(Ashkenazi et a/... (1991). The chimeric protein (Figure 1 ) contains four N-linked glycosyiation sites: three 

15 located in the TNFR region (positions 14. 105. and 1 1 1) and one in the Fc domain (position 248). For 
simplicity, the glycosyiation mutants described in this invention are named using the single letter amino acid 
code with reference to the amino acid residue present at each of the four N-linked glycosyiation sites at amino 
acid positions 14. 105, ill, and 248 (Figure 1). Thus, the designation NNNN represents the fully 
glycosylated TNFRl-IgG 1, whereas NQNN indicates TNFR-IgGl where giutamine replaced asparagine at 

20 the second glycosyiation site (position 105). 

As used herein, the term "immunoadhesin" designates antibody-iike molecules which combine the 
"binding domain" of a heterologous protein (an "adhesin". e.g. a receptor, ligand or enzyme) with the effector 
functions of immunoglobulin constant domains. Structurally, the immunoadhesins comprise a fusion of the 
adhesin amino acid sequence with the desired binding specificity which is other than the antigen recognition 

25 and binding site (antigen combining site) of an antibody (i.e. is heterologous") and an immunoglobulin 
constant domain sequence. The immunoglobulin constant domain sequence in the immunoadhesin may be 
obtained from any immunoglobulin, such as IgG], IgG?, IgG3. or IgG 4 subtypes. IgA. IgE. IgD or IgM. 
Immunoadhesins are described in, for example. U.S. Patent Nos. 5.1 16.964. 5.714.147 and 5.336.603 the 
disclosures of which are hereby incorporated by reference. Immunoadhesins include CD4 (Capon et aL. 

30 (1989) Nature 337:525-531: Traunecker et aL, (1989) Nature 339:68-70: and Byrn et aL, (1990) Nature 
344:667-670): L-selectin or homing receptor (Watson et aL , ( 1990) J. Cell. Biol. I 1 0:222 1 -2229: and Watson 
etal.;(\99\) Nature 349:164-167); CD44 (Aruffo et aL. (1990) Cell 61:1303-13 13: CD28 and B7 (Linsley 
et aL. (1991) J. Exp. Med. 173:721-730); CTLA-4 (Lisley et aL, J. Exp. Med. 174:561-569); CD22 
(Stamenkovic et aL . Cell 66: 1 1 3 3- 1 1 44); TNF receptor (Ashkenazi et aL , ( 1 99 1 ) Proc. Natl. Acad. Sci. USA 

35 88:10535-10539: Lessiauer et aL (1991) Eur. J. Immunol. 27:2883-2886; and Peppel et aL (1991) J, Exp. 
Med. 174:1483-1489; Mohlerera/., (1993) J. Immunol. 15 1:1548-1 56 1);NP receptors (Bennetts a/., (1991) 
J. Biol. Chem. 266:23060-23067; interferon y receptor (Kurschner et aL, (1992) J. Biol. Chem. 267:9354- 
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9360: 4- IBB (Chalupny et at.., (1992) PNAS USA 89:10360-10364) and IgE receptor a (Ridgway and 
Gorman. (1991 ) J. Ceil. Biol. 1 15. Abstract No. 1448). 

The terms "host cell" and "host cell line" refer to cells and cell lines derived from a prokaryotic or 
eukaryotic organism that are capable of growth and survival when placed in either monolayer culture or in 
5 suspension culture in a medium containing the appropriate nutrients and growth factors. Suitable host cells 
for use within the present invention include any type of cell that can be engineered to express a DNA 
molecule, can be grown in culture, and have a secretory pathway. Although prokaryotic cells such as E. coli 
cells are capable of secreting protein at least into the periplasmic space, it is preferred within the context of 
the present invention to use cultured eukaryotic cells, such as fungal cells, insect cells, yeast cells or in 
10 particular mammaiian cells. The necessary growth factors for a particular cell line are readily determined 
empirically without undue experimentation, as described for example in Mammalian Cell Culture (Mather. 
J.P. ed.. Plenum Press. N.Y. [1984]). and Barnes and Sato, (1980) Cell. 22:649. Typically, the cells are 
capable of expressing and secreting large quantities of a particular glycoprotein of interest into the culture 
medium. Examples of such ceils include SF9 insect cells (Summers and Smith (1987) Texas Agriculture 

1 5 Experiment Station Bulletin, 1 555: and Insect Cell Culture Engineering, Goosen Daugulis and Faulkner Eds. 
Dekker. New York). Examples of suitable mammalian host cells within the context of the present invention 
may include Chinese hamster ovary cells/-DHFR (CHO, Urlaub and Chasin. Proc. Natl. Acad. Sci. USA. 
77:4216 [1980]): dpl2.CHO cells (EP 307,247 published 15 March 1989); monkey kidney CVI line 
transformed by S V40 (COS-7. ATCC CRL 1 65 1 ); human embryonic kidney line (293 or 293 cells subcloned 

20 for growth in suspension culture, Graham etai , J. Gen Virol., 36:59 [ 1 977]); baby hamster kidney cells (BHK. 
ATCCCCL 10); mouse Sertoli cells (TM4. Mather, Biol. Reprod.. 23:243-251 [1980]); monkey kidney cells 
(CVI ATCC CCL 70): African green monkey kidney cells (VERO-76, ATCC CRL- 1587): human cervical 
carcinoma cells (HELA. ATCC CCL 2); canine kidney cells (MDCK. ATCC CCL 34): buffalo rat liver cells 
(BRL 3A. ATCC CRL 1442): human iung cells (W138. ATCC CCL 75); human liver cells (Hep G2. HB 

25 8065): mouse mammary rumor < MMT 060562. ATCC CCL5 1 ); TRI cells (Mather et aL Annals N.Y. Acad. 
Sci., 383:44-68 [1982]); MRC 5 cells: FS4 cells; and a human hepatoma line (Hep G2). 

Preferred host cells include Chinese hamster ovary cells/-DHFR (CHO. Urlaub and Chasin. Proc. 
Natl. Acad. Sci. USA, 77:4216 [1980]); dpl2.CHO cells (EP 307.247 published 15 March 1989). 

"Expression vector" refers to a DNA construct containing a DNA sequence which is operably linked 

30 to a suitable control sequence capable of effecting the expression of the DNA in a suitable host. Such control 
sequences include a promoter to effect transcription, an optional operator sequence to control such 
transcription, optional RNA splice donor/acceptor sequences, a sequence encoding suitable mRNA ribosome 
binding sites, and sequences which control termination of transcription and translation. The vector may be 
a piasmid. a phage particle, a recombinant virus or simply a potential genomic insert. Once transformed into 

35 a suitable host, the vector may replicate and function independently of the host genome, or may, in some 
instances, integrate into the genome itself. In the present specification, "piasmid" and "vector" are sometimes 
used interchangeably as the piasmid is the most commonly used form of vector at present. However, the 
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invention is intended to include such other forms of expression vectors which serve equivalent Functions and 
which are. or become, known in the an. 

"Operably linked" when describing the relationship between two DNA regions simply means that 
they are functionally related to each other. For example, a signal sequence is operably linked to a peptide if 
it functions to ensure synthesis on membrane bound ribosomes and to translocate the nascent protein across 
the lumen of the endoplasmic reticulum as a signal sequence, participating in the secretion of the mature form 
of the protein most probably involving cleavage of the signal sequence. A promoter is operably linked to a 
coding sequence if it controls the transcription of the sequence: a ribosome binding site is operably linked to 
a coding sequence if it is positioned so as to permit translation. Therefore, nucleic acid is "operably linked" 
when it is placed into a functional relationship with another nucleic acid sequence. Generally, "operably 
linked" means that the DNA sequences being linked are contiguous, and. in the case of a secretory leader, 
contiguous and in reading phase. However, enhancers do not have to be contiguous. Linking is accomplished 
by ligation at convenient restriction sites. If such sites do not exist, the synthetic oligonucleotide adaptors or 
linkers are used in accordance with conventional practice. 

The terms "transfected." "transformed (host) cell" . "transfectant" and "transformed" and the like refer 
to the introduction of DNA into a cell. The cell is termed a "host cell 1 '. The introduced DNA is usually in the 
form of a plasmid derived vector containing an inserted piece of DNA. The introduced DNA sequence may 
be from the same species as the host cell or a different species rrom the host cell, or it may be a hybrid DNA 
sequence, containing some foreign and some homologous DNA. The words transformants and transformed 
(host) cells include the primary subject ceil and cultures derived therefrom, without regard to the number of 
transfers. It is also understood that all progeny may not be precisely identical in DNA content, due to 
deliberate or inadvertent mutations or amplification of chromosomal fragments. Mutant progeny that have 
the same function or biological property as screened for in the originally transformed ceil are included. 

The term amino acid or amino acid residue, as used herein, refers to naturally occurring L amino 
acids or residues, unless otherwise specifically indicated. The commonly used one- and three-letter 
abbreviations for amino acids are use herein (Lehninger. A. L., Biochemistry. 2d ed.. pp. 71-92. Worth 
Publishers, N. Y. ( 1975)). 
Modes for Carrying out the Invention 

Studies have demonstrated that deletion of N-I inked glycosylation sites from a glycoprotein's primary 
structure often impairs the intracellular transport and secretion of the polypeptide. According to the present 
invention, the defect in secretion of such polypeptides can be overcome by operably linking the nucleic acid 
sequence encoding the polypeptide to a nucleic acid sequence encoding a precursor peptide. Surprisingly, the 
precursor polypeptide increased the secretion efficiency of the parent glycoprotein as well. The precursor 
peptide has an amino acid sequence which correspond to all or a portion of a naturally occurring mammalian 
t-PA signal-pro peptide which participates in the secretion of t-PA under native conditions. 

Preferably the precursor peptide of the present invention is "capable of restoring the export and 
secretion" of a heterologous polypeptide, especially a heterologous glycoprotein such as an immunoadhesin 
and more preferably a heterologous glycosylation site variant glycoprotein to the extracellular space. As will 
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be appreciated by the skilled artisan "capable of restoring the export and secretion" of a polypeptide to the 
extracellular space is a relative term. Thus the term when used to describe the bioioeical activity of the 
precursor peptide means a peptide that when linked to a heterologous polypeptide produces an increase in the 
amount or heterologous polypeptide that can be recovered from the extracellular space when compared to the 
amount recoverable from the extracellular space in the absence of the precursor peptide of the present 
invention and in the presence of the native signal and/or pro sequence of the heterologous polypeptide. 

To determine whether the precursor peptide improves the export and secretion of the polypeptide, 
rhe efficiency of intracellular transport can be measured by routine experimentation. For example, a plasmid 
can be constructed for transient transfection studies in eukaryotic host cells. Pulse-chase experiments similar 
to those described herein will show that replacement of the native signal sequence associated with the 
heterologous poKoeptide with the precursor peptide of the present invention results in an improvement in 
secretion efficiency. Thus, after transient expression at, for example 24 hours, the amount of heterologous 
polypeptide secreted into the culture medium can be measured and compared to a control using a native or 
other signal sequence. For example, approximately 70-80% and preferably 80-100% of the pulse-labeled 
heterologous polypeptide can be secreted into the cell culture medium when a precursor peptide of the present 
invention is employed in the expression construct, whereas only 50-60% of the heterologous polypeptide is 
secreted using the wild-type or native signal sequence. This represents a 20 to 50% increase in secretion 
efficiency. 

The increased efficiency of secretion achieved according to the present invention utilizing the 
precursor peptide is also apparent from the kinetics of secretion of the heterologous polypeptide. For example, 
when the native signal sequence is employed. 40% of the pulse-labeled heterologous polypeptide is secreted 
in 2-4 hrs. after pulse labeling. However when the native signal sequence is replaced with the precursor 
peptide corresponding to a mammalian t-PA signal-pro peptide, approximately 60% of the heterologous 
polypeptide is secreted in this time frame. 

The precursor peptide of the present invention has an ammo acid sequence which corresponds to all 
or a portion of the amino acid sequence of a mammalian t-PA signal-pro peptide including native signal-pro 
peptides of t-PA having the same amino acid sequence as a signal-pro peptide of a mammalian t-PA sequence 
derived from nature. Using human t-PA as but an example, the signal-pro peptide precedes the mature t-PA 
protein sequence and acts to direct export and secretion of the mature protein. The approximately 35 amino 
acid signal pro-peptide of human t-PA consists of a signal sequence of about 21 (SEQ ID NO: 3) or 22 (SEQ 
ID NO: 12) amino acid residues, also called a pre-sequence. and an 1 1 (SEQ ID NO: 4) or 10 (SEQ ID NO: 
13) amino acid pro-sequence, including a 3 amino acid exopeptidase cleavage site (see, for example. Figure 
I infra : Example 8, infra ; and Berg and Grinnell ( 1 99 1) Biochem. Biophys. Res. Commun. 1 79:3 1 289- 1 296). 
However, it is the ability of the precursor peptide to restore or increase export and secretion of the 
heterologous polypeptide and not the length of the precursor peptide that is important. Therefore, in one 
embodiment of the invention, the signal-pro peptide of a mammalian t-PA is a native human sequence 
comprising amino acids 1 to 35 of SEQ ID NO: 1 numbered as shown below and corresponding to amino acids 
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-35 to •! of Fig. I. Therefore, in general the present invention makes use of a precursor peptide having the 
sequence: 

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 15 
Met-Asp-Ala-Met-Lys-Arg-Gly-Leu-Cys-Cys-Val-Leu-Leu-Leu-Cys-Gly- 
5 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 
Ala-Val-Phe-Val-Ser-Pro-Ser-Gln-Glu-Ile-His-Ala-Arg-Phe-Arg-Arg- 

33 34 35 

Gly-Ala-Arg (SEQ ID NO: 1) . 

In another embodiment of the invention, the precursor peptide comprises a human t-PA pro-sequence 

10 as, for example, amino acids 22 to 32 (SEQ ID NO: 4) of SEQ ID NO: i. or amino acids 22-35 (SEQ ID NO: 
5) of SEQ ID NO: 1 and more preferably amino acid residues 23-35 of SEQ ID NO: I (SEQ ID NO: 7). In 
another embodiment of the invention, the precursor peptide is a human signai-pro sequence comprising amino 
acids I to 32 (SEQ ID NO: 6) of SEQ ID NO: I and preferably amino acid residues 1-35 (SEQ ID NO: I). 
Such precursor peptides and the DNA encoding them can be isolated from nature or can be produced by 

! 5 recombinant or synthetic means. 

In a further aspect of the present invention the precursor peptides of the present invention are 
homologous amino acid sequences of mammalian t-PA's signai-pro peptides or homologous amino acid 
sequences of the sequence of. for example, SEQ ID NO: I, SEQ ID NO: 3. SEQ ID NO: 4. SEQ ID NO: 5. 
SEQ ID NO: 6. and SEQ ID NO: 7 including homologous in vitro generated variants having the qualitative 

20 biological activity defined above. Homology with respect to the precursor peptides of the present invention 
is defined as the percentage of amino acid residues in a candidate sequence that are identical with either the 
amino acid residues in SEQ ID NO: 1 . SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5. SEQ ID NO: 6. SEQ ID 
NO: 7. the amino acid sequence of a mammalian t-PA or a composite sequence of a human and other 
mammalian sequence as defined herein after aligning the sequences and introducing gaps if necessary to 

25 achieve the maximum identity. No N- or C- terminal extensions or deletions in the candidate sequence shall 
be construed as reducing identity. For example. N- or C-terminai addition of a signai sequence other than that 
of a mammalian t-PA to mammalian precursor peptide as defined herein is within the scope of the precursor 
peptide of the invention. "Composite amino acid" within the present invention refers to an alternate amino 
acid having the same position in the 35 amino acid residue structure as human t-PA from another mammalian 

30 vertebrate species. Therefore, an amino acid substitution referred to as a composite amino acid substitution 
replaces the identified amino acid with the equivalent or composite amino acid from another mammalian 
species. A composite sequence is defined as having at least one amino acid from the native human t-PA 
sequence replaced with a composite amino acid from another mammalian species. 

Therefore, the invention contemplates an precursor peptide having at least the qualitative biological 

35 activity as defined above and having, for example, at least about 75% amino acid homology with the 
polypeptide of SEQ ID NO: 1. SEQ ID NO: 3, SEQ ID NO: 4 or the polypeptide of SEQ ID NO: 1 lacking 
the 3 carboxyl terminal amino acid residues. SEQ ID NO: 6 and/or the 2 1 amino terminal residues. SEQ ID 
NO: 5. The precursor peptide amino acid sequence preferably will share at least 80%. more preferably, greater 
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than 85% sequence homology with the sequence of SEQ ID NO: I or the sequences iacking the 3 carboxyl 
terminal amino acid residues (SEQ ID NO: 6) and/or the 21 amino terminal residues (SEQ ID NO: 5). 
However, a precursor peptide may exhibit less than 50% sequence homology with the sequence of SEQ ID 
NO: 1 and still retain the characteristics of the precursor peptide as defined above. 
5 Since the mechanism by which the signal-pro peptide of t-PA overcomes endoplasmic reticulum 

retention may relate to its complex structure, precursor peptides which comprise a portion of a signal-pro 
peptide as defined above are also within the scope of the present invention. For example, as noted above, the 
signal-pro peptide of t-PA is unusual, and contains an approximately 2 1 -22 amino acid pre- or signal sequence 
and a 10 or 11 amino acid pro-sequence, including a 3 amino acid exopeptidase cleavage site (Figure 1). 
10 Previous studies, (Berg and Grinneil (1991) Biochem. Biophys. Res. Commun. 1 79: 1289-1296) have shown 
that the pre- or signal sequence is cleaved co- trans iationally in the endoplasmic reticulum and that the pro- 
sequence is removed in the Golgi apparatus (GA ) by cleavage at a furin processing site. The fact that the endo 
H sensitive, intracellular forms of an exemplary heterologous glycoprotein glycosylation sitevariantexpressed 
with the amino terminal precursor peptide sequence of the present invention possessed higher molecular 

1 5 weight than the secreted products demonstrates that, like wild type t-PA. removal of the pro-sequence from 
glycosylation mutants occurs during export through the GA. Based on these observations, the amino terminal 
pro-sequence of t-PA represents a structural element able to promote ER to GA transport of the heterologous 
polypeptide, and especially glycosylation mutants. 

Therefore according to a preferred aspect of the present invention, a precursor peptide comprising 

20 ail or a portion of the pro-sequence of a mammalian-tPA is employed to effect the export and secretion of the 
heterologous polypeptides of the present invention. For example, the pre-sequence of any mammalian pre- 
protein sequence, for example, that associated with the native heterologous polypeptide, may be employed in 
conjunction with the pro-sequence of a mammalian t-PA in the precursor peptide of the present invention. 
For example, insertion of the 13 ammo acid tPA pro-peptide Ser-Gln-Glu-Ile-His-Ala-Arg-Phe-Arg-Arg-Giy- 

25 A la-Arg (SEQ ID NO: 7) between the TNFR 1-IgGl endogenous signal sequence, i.e., that naturally associated 
with TNFR 1 (Gray etai.AX 990) supra , ammo acids -40 to - 1 2 of Figure I. SEQ ID NO: 2) and the TNFR I - 
IgGl coding sequence provides a precursor peptide (SEQ ID NO: 8) within the context of the present 
invention that exhibits an increase in export and secretion of the mature molecule (TNFRl-lgGI). 

According to another aspect of the present invention the DNA sequence of the t-PA pro-sequence 

30 (SEQ ID NO: 5; SEQ ID NO: 7) is modified to introduce or eliminate proteolytic processing sites. Such 
modified precursor peptides are described in, for example. International publication No. WO 96/17067. 
Briefly, and while not wishing to be bound by theory it is believed that pro-sequence of mammalian t-PA is 
cleaved at a site dependent upon a prohormone converting enzyme such as the yeast KEX2 gene product or 
the mammalian enzymes PCI. PC2 and furin. Enzymes of this type recognize cleavage sites characterized 

35 by arginine residues in the - 1 and -4 positions. Cleavage is facilitated by a basic amino acid residue e.g. Lys 
or Arg in the - 1 position. Within the present invention therefore are a pro-sequence having an Arg at residue 
35 and 32 and optionally a basic amino acid residue at position 34. 
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As indicated, the present invention furthercomprisesa second DNA segment encoding a heterologous 
polypeptide, i.e.. a polypeptide other than a mammalian t-PA such as a TNFR immunoadhesin. Preferably 
the heterologous polypeptide lacks or has been modified to delete one or more native slycosyfation sites as 
described. 

Deletion of glycosylation sites from a heterologous polypeptide may be accomplished by altering the 
amino acid sequence of the polypeptide. The alteration may be made by, for example, the addition of or 
substitution by one or more amino acids to the polypeptide's native sequence. For example, if N-linked 
glycosylation is contemplated, the glycosylation site in the variant is a tripeptidyl sequence of the formula: 
asparagine-X-serine or asparagine-X-threonine. wherein asparagine is the acceptor and X is any of the twenty 
genetically encoded amino acids except proline. (See Struck, D.K. and Lennarz. W.J. in The Biochemistry 
of Glycoproteins and Proteoglycans . W.J. Lennarz ed.. Plenum Press. 1980, p. 35; Marshall. R.D. (1974) 
Biochem. Soc. Symp. 40:17: and Winzler. R.J. in Hormonal Proteins and Peptides. Li. C.I. ed.. Academic 
Press. New York. 1973, pp. 1-15). The amino acid sequence variants herein are modified by substituting the 
appropriate amino acidfs) at the proper site(s ) to effect a change in the native glycosylation site. For example 
an asparagine residue can be replaced by a giutamine residue to disrupt a native glycosylation site within the 
polypeptide. 

If O-linked glycosylation is to be employed. O-glycosidic linkage occurs in animal cells between N- 
acetylgaiactosamine. galactose or xylose, and one of several hydroxyamino acids, most commonly serine or 
threonine, but also in some cases a 5-hydroxyproiine or 5-hydroxylysine residue placed in the appropriate 
region of the molecule. 

According to the present invention the polypeptides are altered through changes at the DNA level, 
particularly by mutating the DNA encoding the polypeptide at preselected bases, i.e.. those encoding N- or 
O-linked glycosylation sites such that codons are generated that will translate into amino acids otherthan those 
that will typically be present for N- or O-linked glycosylation. The variations can be made using methods 
known in the an such as oligonucieotide-mediated (site directed) mutagenesis, alanine scanning and PCR 
mutagenesis. Site directed mutagenesis (Carter et at.. . ( 1 986) Nucl. Acids Res. 13:4331: Soiler et ai. .. ( 1 987) 
Nuci. Acd. Res.. 10:6487, cassette mutagenesis (Wells et al... (1985) Gene 34:315). restriction selection 
mutagenesis (Wells et ai.{\ 986) Philos. Trans. R. Soc. London ) or other known techniques can be performed 
on the cloned DNA to produce the glycosylation site mutants that are a subject of the present invention. 

Therefore, according to the present invention in order to direct the secretion of a heterologous 
polypeptide and preferably a glycosylation variant as described above, a DNA segment encoding a precursor 
peptide is joined to a DNA sequence encoding the heterologous polypeptide in the correct reading frame so 
that the joined sequences encode a fusion protein. 

Expression vectors for use in the methods and compositions of the present invention will generally 
include a promotor capable of directing the transcription of the first and second DNA of the invention. The 
term "control sequences" refers to DNA sequences necessary for the expression of an operably linked coding 
sequence in a particular host organism. The control sequences that are suitable for prokaryotes. for example, 
include a promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are known 
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to utilize promoters, poiyadenylation signals, and enhancers. Preferred profoy^fsWlude viral promoters ana 
cellular promoters. 

Host ceils are transfected or transformed with expression or cloning vectors described herein for 
heterologous polypeptide production and cultured in conventional nutrient media modified as appropriate for 
inducing promoters, selecting trans formants, or amplifying the genes encoding the desired sequences. The 
culture conditions, such as media, temperature. pH and the like, can be selected by the skilled artisan without 
undue experimentation. In general, principles, protocols, and practical techniques for maximizing the 
productivity of cell cultures can be found in Mammalian Cell Biotechnology: a Practical Approach . M. Butler, 
ed. (IRL Press. 1 99 U and Sam brook et a/... supra. 



Methods of trans fection are known to the ordinarily skilled artisan, for example. CaP0 4 and 
electroporation. as well as the use of commercially available cationic lipid reagents which facilitate 
transfection. Depending on the host cell used, transformation is performed using standard techniques 
appropriate to such cells. The calcium treatment employing caicium chloride, as described in Sambrook et 
ai.. supra , or eiectroporation is generai.lv used for prokaryotes or other cells that contain substantial cell-wall 
barriers. Infection with Agrobaaerium tumefaciens is used for transformation of certain plant cells, as 
described by Shaw et a/... Gene. 23:3 15 (1983) and WO 89/05859 published 29 June 1989. For mammalian 
cells without such cell wails, the calcium phosphate precipitation method of Graham and van der Eb, Virology. 
52:456-457 (1978) can be employed. General aspects of mammalian cell host system transformations have 
been described in U.S. Patent No. 4,399.216. Transformations into yeast are typically carried out according 
to the method of Van Soiingen et at... J.Bact., ±30:946 ( 1 977) and Hsiao et ai. . . Proc. Natl. Acad. Sci. (USA). 
76:3829 ( 1 979). However, other methods for introducing DN A into cells, such as by nuclear microinjection, 
electroporation, bacterial protoplast fusion with intact cells, or polycations. e. g.. polybrene. polyornithine,may 
also be used. For various techniques for transforming mammaiian cells, see Keown et a/... Methods in 
Enzvmologv. 185:527-537 (1990) and Mansour et ai... Nature . 336:348-352 (1988). 

Suitable host cells for the expression of heterologous polypeptide that retain one or more 
glycosylation sites are derived from multicellular organisms. Examples of invertebrate cells include insect 
ceils such as Drosophiia S2 and Spodoptera Sf9. as well as plant cells. Examples of userui mammalian host 
ceii lines include Chinese hamster ovary (CHO) and COS cells. More specific examples include monkey 
kidney C V 1 line transformed by S V40 (COS-7. ATCC CRL 1651); human embryonic kidney line (293 or 293 
ceils subcioned for growth in suspension culture. Graham et ai.., J. Gen Virol.. 36:59(1 977)): Chinese hamster 
ovary cells/-DHFR (CHO, Uriaub and Chasin, Proc. Natl. Acad. Sci. USA. 77:4216 ( 1980)); mouse Sertoli 
cells (TM4. Mather, Biol. Reprod.. 23:243-251 (1980)); human lung cells (W138. ATCC CCL 75); human 
liver ceils (Hep G2. HB 8065); and mouse mammary tumor (MMT 060562, ATCC CCL51). The selection 
of the appropriate host ceii is deemed to be within the skill in the art. 

Host cells containing DNA constructs of the present invention are then cultured to produce the 
heterologous protein. The cells are cultured according to standard methods in a culture medium containing 
the necessary nutrients for growth of cultured mammalian cells. 



-15- 



SUBST1TUTE SHEET (RULE 26) 



WO 99/53059 



PCT/US99/081I0 



For the culture of the eukaryotic cells expressing the desired protein and modified as described for 
the instant invention, numerous culture conditions can be used paying particular attention to the host cell being 
cultured. Suitable culture conditions for eukaryotic ceils are well known in the an (J. Immunol. Methods 
0983)56:221-234) or can be easily determined by the skilled artisan ( see. for example. Animal Cell Culture: 
A Practical Approach 2nd Ed., Rickwood, D. and Hames, B.D.. eds. Oxford University Press, New York 
(1992)), and vary according to the particular host cell selected. 

The eukaryotic cell culture of the present invention is prepared in a medium suitable for the particular 
cell being cultured. Commercially available media such as Ham's F10 (Sigma). Minimal Essential Medium 
([MEM]. Sigma i. RPMI-1640 (Sigmaj, and Duibecco's Modified Eagle's Medium ([DMEM]. Sigma) are 
exemplary nutrient solutions. In addition, any of the media described in Ham and Wallace,* 1 979) Meth. Enz., 
58:44: Barnes and Sato.( 1980) Anal. Biochem.. 102:255: U.S. Patent Nos. 4.767.704; 4.657.866: 4.927,762; 
5. 122.469 or 4.560.655: International Publication Nos. WO 90/03430: and WO 87/00 1 95; the disclosures of 
all of which are incorporated herein by reference, may be used as culture media. Any of these media may be 
supplemented as necessary with hormones and/or other growth factors (such as insulin, transferrin, or 
epidermal growth factor), salts (such as sodium chloride, calcium, magnesium, and phosphate ». buffers (such 
as HEPES). nucleosides (such as adenosine and thymidine;, antibiotics (such as Gentamycin™ drug), trace 
elements (defined as inorganic compounds usually present at final concentrations in the micromolar range) 
lipids (such as linoleic or other fatty acids) and their suitable carriers, and glucose or an equivalent energy 
source. Any other necessary supplements may also be included at appropriate concentrations that would be 
known to those skilled in the an. 

In a particular embodiment, the mammalian host cell is a CHO cell, preferably a DHFR-CHO cell 
and a suitable medium contains a basal medium component such as a DM EM/HAM F- 12 based formulation 
(for composition of DMEM and HAM F12 media, see culture media formulations in American Type Culture 
Collection Catalogue of Cell Lines and Hybridomas. Sixth Edition. 1 988. pages 346-349) (the formulation of 
medium as described in U.S. Patent 5.122.469 are particularly appropriate) with modified concentrations of 
some components such as amino acids, salts, sugar, and vitamins, and optionally containing glycine, 
hypoxanthine, and thymidine; recombinant human insulin, hydrolyzed peptone, such as Primatone HS or 
Primatone RL (Sheffield. England), or the equivalent; a cell protective agent, such as Piuronic F68 or the 
equivalent piuronic polyol; Gentamycin: and trace elements. 

For the production of the sought after glycoproteins, production by growing the host cells of the 
present invention under a variety of cell culture conditions is typical. For instance, cell culture procedures for 
the large or small scale production of proteins are potentially useful within the context of the present invention. 
Procedures including, but not limited to, a fluidized bed bioreactor. hollow fiber bioreactor. roller bottle 
culture, or stirred tank bioreactor system may be used, in the later two systems, with or without microcarriers. 
and operated alternatively in a batch, fed-batch, or continuous mode. 

Following the polypeptide production phase, the polypeptide of interest is recovered from the culture 
medium using techniques which are well established in the an. 
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The polypeptide of interest preferably is recovered rrom the culture medium as a secreted 
polypeptide. As a first step, the culture medium or (ysate is centrifuged to remove paniculate cell debris. The 
polypeptide thereafter is purified from contaminant soluble proteins and polypeptides, with the following 
procedures being exemplary of suitable purification procedures: by fractionation on immunoaffmity or ion- 
5 exchange columns: ethanol precipitation: reverse phase HPLC: chromatography on siiica or on a cation- 
exchange resin such as DEAE: chromatofocusing; SDS-PAGE: ammonium sulfate precipitation: gel filtration 
using, for example. Sephadex G-75; and protein A SEPHAROSE™ columns to remove contaminants such 
as IgG. A protease inhibitor such as phenyl methyl suifonyl fluoride (PMSF) also may be useful to inhibit 
proteolytic degradation during purification. 
10 The following examples are offered by way of illustration and not by way of limitation. The 

disclosures of all citations in the specification are expressly incorporated herein bv reference. 

EXAMPLES 

Introduction 

The following are general methods used in the Examples that follow 

!5 Methods 

DNA and Constructions. The chimeric gene encoding TNFR-IgG I is described (Ashkenazi et ai. 
(1990 Proc. Natl. Acad. Sci. 88: 10535-10539).Site directed mutagenesis was carried out following the 
mismatched primer method of Kunkel. eta/. (1985) Proc. Natl, Acad. Sci. 82:488-492) using the Muta-Gene 
Ml 3 Kit(Bio-Rad Inc.. Hercules. CA). Mutations were verified by dye terminator cycle sequencing using the 
automated ABI 310 Genetic Analyzer (Applied Biosystems. Foster City. CA). For simplicity, the 
giycosy lation mutants described in this paper are named using the single letter amino acid code with reference 
to the amino acid residue present at each of the four N-linked glycosyiation sites at amino acid positions 14, 
105. 1 1 1. and 248 (Figure 1). Thus, the designation NNNN represents the fully glycosylated TNFR-IgGl, 
whereas NQNN indicates TNFR-IgG I where glutamine replaced asparagine at the second glycosyiation site 

25 (position 105). TNFR-IgGl glycosyiation variants were cloned into the expression vector pRK5 (Eaton et 
ai.. ( 1 986) Biochem. 29 1 :8343-8347), that enabled transient expression in the human embryonic kidney 293 
cell line. The pRK5 chimeric transcription unit includes a cytomegalovirus (CMV) immediate early promoter 
and a simian virus 40 (S V-40) polyadenylation site. For expression in CHO cells, gene encoding TNFR-IgG 1 
variants were cloned into the closely related expression vector pSVI6B5 that differs from pRK5 in that 

30 transcription is directed by SV-40 promoter and enhancer elements rather than by the CMV promoter. 

Cells and Transfectwns. 293 cells were cultured in 1 00 mm culture plates containing a mixture of 
Dulbecco s modified Eagles medium (DMEM) and Ham's F12 medium (F12) (Gibco BRL. Grand Island. 
NY) supplemented with 10% whole fetal bovine serum (FBS) and incubated at 37°C in an atmosphere 
containing 5% C0 2 . Confluent plates of 293 cells were passaged at a ratio of 1:5; and were rransfected at 60% 

35 confluency with plasmids containing TNFR-IgG 1 glycosyiation variants by the calcium phosphate method 
(Graham and Van Der Eb (1973) Virol. 52:456-467). For expression of TNFR-IgGl variants in CHO cell 
lines, the genes encoding TNFR-igGl glycosyiation site mutants were cloned into the pSVI6B5 expression 
vector. These plasmids were then co-transfected into CHO cells deficient in the production of dihydro folate 
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reductase (dhfr) alone with a plasmid (pFDl 1 ) containing a cDNA encoding murine dhfr. The transfected 
cells were selected for the ability to grow in culture medium deficient in the production of glycine, 
hypoxanthine and thymidine. Resulting colonies were picked and then selected for growth in varying 
concentration of methotrexate. Stable cell lines expressing TNFR-IgGl glycosylation site mutants were 
5 grown in 10 liter mini-fermentors. and growth conditioned cell culture medium was nan est; i for affinity 
purification (described below). 

Metabolic Labeling and Immunoprecipitation. Two days post-transfection. the culture medium was 
removed from the transfected 293 ceils and the ceil monolayer washed twice with phosphate buffered saline 
(PBS). Cells were incubated in methionine and cysteine free DMEM and supplemented with ProMix 35S- 

10 Cell Labelling Mix (Amersham. Arlington Hts, 1L) (100(Ci/mL). The ceils were labeled 8-16 hours at 
37 °C in an atmosphere of 5% C02. After labeling, the cell supernatants were removed, centrifuged at 4000 
rpm for 5 minutes, and 1 mL was aliquoted for immunoprecipitation experiments. The labeled cells were 
washed three times with PBS. iysed directly on the culture dishes with cell lysis buffer (PBS containing 3% 
NP-401 and centrifuged at 14.000 x g for 5 minutes. The lysate was transferred to a fresh tube, and 200 uL 

15 was removed for immunoprecipitation. Immunoprecipitation of both supernatants and lysates was 
accomplished by addition of 30 uL of S. aureus Protein A (Pharmacia Inc., Piscataway, NJV After a short 
incubation period on ice (10-15 minutes) the Protein A/TNFR-igGl complexes were sedimented by 
centrifiigation at 14.000 x g for I minute, washed in wash buffer (PBS. 1% NP-40. 0.1% sodium dodecyl 
sulfate (SDS)), and resuspended in 2X sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS- 
20 PAGE) sample buffer containing 5% 2-mercaptoethanol (2-ME). The immunoprecipitated proteins were 
resolved by SDS-PAGE using i 0%Tris-GIycine polyacrylamide gels (Novex. Inc.. San Diego. C A). Proteins 
were visualized by autoradiography and mobilities were calculated with reference to !4 C-MethylatedRainbow 

Colored Protein Molecular Weight Markers ( Amersham. Arlington Heights. iL). Proteins were quantitated 
by densitomerric scanning of autoradiographs using the Personal Densitometer (Molecular Dynamics. 
25 Sunnyvale, CA). 

Pulse Chase Analysis. 48 hours post-transfection. cells were pulse-labeled with ProMix j5 S-Cell 
Labeling Mix (500(Ci /mL) for 1 5 minutes. The labeling medium was then removed and the ceil monolayer 
was washed at room temperature with PBS. Fresh medium (DMEM/F12 * 10% FBS) was then added, and 
cells were harvested at various time-points. For harvesting, plates were chilled on ice: supernatants were 

30 removed, and the cell monolayer was washed once with PBS. Cells were then Iysed with ImL of lysis buffer. 
Lysate and supernatant samples were aliquoted and immunoprecipitated as described above. 

Purification of TNFR-IgG glycosylation site mutants. Growth conditioned culture medium from 
cultures at 10 L CHO cells expressing TNFR-IgGl glycosylation site mutations were purified by affinity 
chromatography on immobilized S. aureus protein A using a method described previously (Chamow et ai. 

35 (1994) J. Immunol. 153:4268-4280). Four TNFR-IgGl variants were purified: NNNN. NNQQ, NSNQ, and 
QSNQ. Briefly, purification was involved the following steps. (I) Prior to sample loading, the Protein A 
column was equilibrated with 20 mM Tris buffer. 15 mM NaCl (pH 7.4). After the growth conditioned cell 
culture medium was loaded onto the column, the column was sequentially washed with the following buffers: 
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20 mM Iris buffer. 150 mM NaCl fpH 7.4): 20 mM Tris buffer. 500 mM TMAC (pH 7.4): and 20 mM Tris 
buffer(pH 7.4). The giycosyiation site mutants were eluted with 50 mM citric acid, 20°/o(w/v) glycerol (pH 
3 .0) and the elution pools were subsequently adjusted to pH 6.0 using 1 .0 M sodium citrate. Finally, the eiuates 
were buffer exchanged by gel filtration chromatography into PBS (pH 7.4). the punned mutants were 
5 analyzed using SDS-PAGE ( 1 2.5%) with Coomassie blue staining. 

TNF binding assay. The binding of TNF to TNFR-IgG I and TNFR-IgGl giycosyiation site mutants 
was determined using a competitive binding immunoassay similar to that described by Ashkenazi. et aL 
{ 1991 ) supra . Briefly, microliter plates were coated with affinity purified goat antibodies to human igG Fc 
domain. Purified TNFR-IgG 1 or TNFR-IgG 1 giycosyiation site mutants were captured on the plates and then 
10 reacted with varying concentrations of unlabeled TNF (0.8 to 800 nM) and a fixed amount of ( 125 I)-labeied 
TNF (0.05 nM). After washing the amount of TNF remaining bound to the weiis of the microliter plates was 
determined with a gamma counter. Data were fit using a four parameter curve. The effective concentration 
of unlabeled TNF that resulted in half maximal (50%) binding of ( 125 !)-labeled TNF to TNFR-IgG 1 
giycosyiation mutants was reported as EC< 0 values. 
15 EXAMPLE 1 

TNFR-IgG 1 is a chimeric protein constructed by fusing the extracellular domain of the receptor for 
TNF alpha with sequences encoding the Fc domain and hinge region of IgG I ( Ashkenazi et a/... ( 1 99 1 ) supra). 
The chimeric protein (Figure 1 ) contains four N-Iinked giycosyiation sites: three located in the TNFR region 
(positions 14. 105, and III) and one in the Fc domain (position 248). Previous studies have shown that 
20 TNFR-IgGl is secreted as a homodimer. binds to TNF alpha with high affinity, and has potent anti- 
inflammatory activity in vivo. 

Pulse-chase experiments were carried out to characterize the secretion efficiency of TNFR-IgG I . In 
these studies, 293 cells were transiently transfected with a calcium phosphate precipitated piasmid 
(pRK. TNFR-IgG 1 ) and cultured for 2 days. The ceils were puise-iabeled with (~°S )- methionine, and samples 
25 were collected at various time points. Immunoprecipttation studies (Figure 2) indicated that TNFR-IgG I first 
appeared in the cell culture medium approximately 1 hr after pulse labeling, and that the amount of secreted 
protein progressively increased over the entire 24 hr observation period. These studies also showed that a 
significant fraction of the TNFR-IgGl synthesized during a 15 minute pulse labeling was retained inside the 
cells. Densitometric analysis of these data (Figure 3 A) demonstrated that only about 50% of the pulse-labeled 
30 protein was secreted from the cell during a 24 hr period, and that labeled precursor appeared to remain in a 
stable intracellular pool for more than 24 hr. The kinetics and pattern of secretion observed for TNFR-IgG 1 
were similar to those previously reported for two other secreted variants of membrane glycoproteins (Berman 
etai., (1989) J. Virol. 63: 3489-3498) suggesting that secretion of TNFR-IgGl was similarly inefficient. 

Endoglycosidase digestion and immunofluorescence studies similar to those described by Machamer 
35 and Rose ( 1 988) J. Biol. Chem. 263:5955-5960) showed that the intracellular TNFR-IgG 1 could be localized 
to the ER and was sensitive to endoglycosidase H digestion (endoH). These studies further showed that 
secreted TNFR-IgGl was resistant to endo H digestion and sensitive to neuraminidase digestion. Together 
these results suggested that the intracellular TNFR-IgG I accumulated in the endoplasmic reticulum (ER) and 
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possessed the characteristic high mannose form of N- linked carbohydrate, whereas the secreted protein 
possessed terminal siaiation acquired in the trans-Golgi apparatus (tGA). 

EXAMPLE 2 

deletion or giycosyiaiion siics can impair secretion of TNFR-i^Gi. Tu examine iiic ciTcU of 
5 sequential deletion of glycosyiation sites on the secretion of TNFR-IgG K further experiments were carried 
out with a series of glycosyiation mutants (Table 1). 

Table 1 

Summary of TNFRl-lgGl Glvcosvlation Mutants 
Amino acid position Secretion Efficiency 
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In these studies, mutagenesis primers were designed to replace the codon specifying asparagine (N) in the N- 
linked carbohydrate attachment seqon, N-X-S/T. with codons specifying giutamine (Q). Because 
glycosyiation sites 2 and 3 in TNFR-IgG I are only a few amino acids apart (residues 1 05 and 1 1 1). a single 
35 mutageneis primer could be used to mutate both sites simultaneously. We found that all of the glycosyiation 
mutants were expressed, but there were significant differences in secretion efficiency (Figure 4 A). Removal 
of the fourth glycosyiation site (residue 248). located in the Fc domain of TNFR-IgG 1 (NNNQ) had no effect 
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on secretion, relative to wild type TNFR-IgG I (NNNN). In contrast, removal of giycosylation sites 2 and 3 
(NQQN) resulted in a nearly complete blockade of secretion. Deletion of the first giycosylation site (QNNN) 
also blocked secretion, however in this case secretion was approximately 35% of that seen with the fully 
giycosyiatea protein. Removal or tne first tnree giycosylation sites (WWN) completely inhibited secretion 
5 as did removal of all four giycosylation sites (QQQQ). 

Further mutants were constructed (Table 1 ) to study, independently, the influence of giycosylation 
sites 2 and 3 on secretion. It was found that removal of giycosylation site 3 alone (NNQN) allowed for 
secretion at approximately 10% of level seen with the fully glycosylated protein, whereas deletion of 
giycosylation site 2 alone (NQNN) completely abolished secretion (Figure4B). Deletion of giycosylation sites 
10 3 and 4 (NNQQ) resulted in a secretion efficiency (Figure 4C) that was similar to the NNQN mutant but lower 
than the NNNQ mutant (Figure 4B). Deletion of giycosylation sites 1 and 4 (QNNQ) resulted in significantly 
lower levels of secretion (Figure 4C) than was observed with the QNNN and NNNQ mutants alone (Figure 
4A). 

EXAMPLE 3 

1 5 Nature of the amino acid side chain used to mutate N linked giycosylation site can effect secretion 

efficiency. Additional studies were carried out to determine if the nature of the amino acid side chain used 
for replacement of the N residue in the sequence creating an N-linked giycosylation site affected secretion 
efficiency. Each N-linked oligosaccharide chain can possess two to four sialic acid residues, depending on 
the antennarity of the high mannose precursor. We reasoned that replacement of N at N-linked giycosylation 

20 sites with charged residues (e.g. aspartic acid, glutamic acid, lysine, arginine) might have less effect on 
conformation than replacement with hydrophilic uncharged side-chains (e.g. glutamine. threonine, serine). 
We found that replacement of the N residues at giycosylation sites 2 and 3 with acidic, aspartic acid residues 
(NDDN) or basic, lysine residues (NKKN) allowed for better secretion (Figure 4B) than the uncharged 
glutamine residues in the (NQQN) construct (Figure 4A ); however, the amount of protein secreted was far less 

25 than that observed with the wild-type TNFR-IgG 1 protein. Replacement of asparagine at sites 2 and 3 with 
arginine (NRRN). also gave better secretion than the NQQN mutant, but the total amount of secreted protein 
appeared to be less compared to the NDDN or NKKN mutants (Figure 4D). 

We next considered the effect of uncharged, hydrophilic side-chains, serine and threonine, on the 
secretion efficiency of TNFR-IgG 1. We found that replacement of giycosylation sites 2 and 3 with serine 

30 (NSSN) or threonine (NTTN). like the glutamine mutants (NQQN) completely inhibited secretion(Figure4D). 
However, single substitutions of serine for asparagine at giycosylation sites 2 and 3 with serine gave different 
results (Figure 4D). The replacement of serine for asparagine at giycosylation site 2 (NSNN) allowed for 
better secretion than any of the other amino acid replacements examined at this position. In contrast, 
replacement of serine for asparagine at giycosylation site 3 (NNSN) resulted in secretion similar to that 

35 observed in the NNQN mutant (Figure 4B). 
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EXAMPLE 4 

Triple deletion mutants. To investigate the effect of glycosylation site 4 in combination with sites 
1-3. a series of mutants were constructed where three of the four giycosyiation sites were deleted (Table I). 
It was found that the NDDQ, the NfCKQ, the NDDN and NKKN mutants were ail secreted from transfected 
5 cells (Figure 4C). However, in ail four cases the amount of protein secreted was a small fraction 
(approximately 10%) of the amount secreted by fully glycosylated TNFR-IgGl. Thus, removal of 
giycosyiation site 4 did not alter the secretion efficiency of variants where asparagine residues at glycosylation 
sites 2 and 3 were replaced by lysine or aspartic acid. A final mutant where three of the four giycosyiation sites 
were deleted (QSNQ) was also examined (Table K Figure 5). This variant accumulated intracelluiarly and 
1 0 was not secreted. 

EXAMPLE 5 

Secretion efficiency of immunoadhesins can be improved by altering the signal sequence. Studies 
were performed to demonstrate that the yield and secretion efficiency of a representative immunoadhesin. 
TNFR-IgGl. from stably transfected CHO ceils can be improved by replacement of the TNFR-IeGl sienal 

15 sequence with that of human tissue plasminogen activator (tPA). In these studies the signal sequences of 
herpes simplex virus type 1 glycoprotein D(HSV gD-I ).Met-Gly-Gly-Thr-Aia-Ala-Arg-Leu-Gly-Ala-Vai-IIe- 
Leu-Phe-Vai-Val-Ile-Val-Gly-Leu-His-Gly-Val-Arg-Gly (SEQ ID NO: 9). recombinant human DNase, 
Met-Arg-Gly-Lys-Leu-Leu-Gly-Ala-Leu-Leu-^ 
(SEQIDNO: 10), HER-2.Met-Gly-Trp-Ser-Cys-I^ 

20 (SEQ ID NO: 1 1 ), TNFR, Met-Gly-Leu-Ser-Thr- Vai-Pro-Asp-Leu-Leu-Leu-Pro-Leu- Val-Leu-Leu-GIu-Leu- 
Leu-Val-Gly-Ile-Tyr-Pro-Ser-Gly-Val-Ile-Gly (SEQ ID NO: 2) (Ashkenazi era/.. (1991) Proc. Natl. Acad. 
Sci. 88: 10535- 1 0539), and t-PA (SEQ ID NO: 1 ) were operably linked to the nucleic acid sequence encoding 
TNFR 1 -IgG 1 and the secretion kinetics were followed. The results indicated an increase in amount of TNFR- 
IgG 1 that could be recovered from the supernatant as well as an improvement in the secretion kinetics (2 hrs 

25 and 24 hrs). 

To determine whether the tPA signal/pro-sequence improved yield by increasing the efficiency of 
intracellular transport, a plasmid (pRK.tPA-TNFR-[gG I ) was constructed for transient rransfection studies in 
293 ceils. Pulse-chase studies (Figure 2, Figure 3 A and B) showed that replacement of the TNFR signal 
sequence with the tPA signal/pro sequence resulted in a significant improvement in secretion efficiency. Thus. 

30 after 24 hours, approximately 70-80% of the pulse-labeled TNFR-IgGl was secreted into the cell culture 
medium when the tPA signal/pro sequence was included, whereas only 50-60% of the protein was secreted 
using the wild-type TNFR signal sequence. The increased efficiency of secretion achieved with the tPA 
signal/pro sequence was apparent from the kinetics of secretion of TNFR-IgGl (Figures 2. Figures 3 A and 
B). When the TNF signal/pro sequence was used, only 40% of the pulse-labeled TNFR-IgGl was secreted in 

35 2-4 hr after pulse labeling. However when the TNF signal sequence was replace with the tPA signal/pro 
sequence approximately 60% of the TNFR-IgG synthesized with the TNFR signal sequence was secreted in 
this time frame. 
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Studies of the biosynthesis of tPA have shown that the N terminal processing of tPA is complex and 
involves co-rranslational cleavage of a 21 amino acid signal sequence, post-translation cleavage of an 1 1 
amino acid pro-sequence at a rurin cleavage site, and extracellular cleavage of a 3 amino acid. N-terminal 
peptide by an undefined exopeptidase (Figure 1 ). This complex strategy for N terminal processing of TNFR- 
5 IgG I mutants is supported by our results showing that intracellular TNFR-IgG 1 variants possessing the tPA 
signal/pro sequence possessed higher molecular weights than the corresponding secreted proteins (Figure 5 A). 
Since nirin is known to be localized in the trans-GA. the pro-sequence would not be expected to be removed 
until the tPA-TNFR precursor has been transported from the ER to the trans-GA. Endoglycosidase digestion 
studies and immunofluorescence studies have provided data consistent with an ER localization of intracellular 
10 variants of tPA.TNFR-IgGl . 

EXAMPLE 6 

Secretion efficiency of glycosylation site mutants can be improved by altering the signal sequence. 
To determine whether signal sequence exchange could improve the secretion efficiency of the TNFR-IgG 1 
glycosylation site mutants described in Figure 4. the TNFR signal sequence was deleted and replaced by the 
15 tPA signal/pro sequence. It was found (Table 2. Figure 5) that the signal sequence exchange resulted in a 
marked increase in secretion efficiency of these glycosylation site mutants. 

Table 2 

Effect of tPA Signal Sequence or Secretion of TNFR I -IgG 1 Glvcosviation Mutants 

% Secretion Efficiency* EC50 for 

20 TNF Binding ! 



Mutant 


TNFR.ss 


tPA.ss 


(nM) 




NNNN 


50 


70 
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0.73 


NNQQ 


20 
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4.74 -r/- 


0.48 


N SN 0 
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6.94 


0.83 


NKKQ 


10 


40 


ND 




QSNQ 


<5 


65 


2.34 


0.22 


QQQQ 


<5 


<5 


ND 





* Data represent results from pulse-chase experiments (e.g. Figure 3) where the percentage of pulse-labeled 
30 protein secreted in a 24 hr period was measured by scanning densitometry. 

JData represent EC50 values for the displacement of ( l25 I)-labeled TNF by TNFR-IgG 1 glycosylation site 
mutants (Figure 7). 

For example, only 10-20 % of the NNQQ mutant was secreted using the TNFR signal sequence, 
whereas 60-70% of the NNQQ mutant containing the tPA signal/pro sequence was secreted (Figures 3C and 
35 D, Figure 5). Similarly, little or none of the QSNQ mutant was secreted containing the TNFR signal sequence, 
but approximately 60% was secreted from the tPA signal/pro sequence containing variant (Figures 3E and F. 
Figure 5V Similar improvements in secretion efficiency were observed for the NKKQ and NSNQ mutants 
(Figure 5). Densitometric analysis of pulse chase experiments (Figure 3) showed that attachment of the tPA 
signal/pro sequence accelerated the kinetics of intracellular transport as well as increasing the total amount 
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of secreted protein. Although replacement of the tPA /pro sequence could overcome the blockade of protein 
secretion for many of the giycosyiation mutants, this strategy was not effective for the fully glycosylated 
OQQQ giycosyiation site mutant (Figure 5). 

EXAMPLE 7 

5 Structure and TNF binding activity of TNFR-IgG 1 giycosyiation mutants. Although the signal 

sequence exchange strategy described above provided a method that allowed for the secretion of TNFR-IgG 1 
giycosyiation site mutants, we wondered whether the secreted proteins were properly folded. To answer this 
question, stable CHO cell lines, expressing several of the TNFR-IgG I giycosyiation mutants, were constructed 
in order to produce sufficient quantities for iigand binding studies. Recombinant proteins representing the 

1 0 NNQQ, NSNQ. and QSNQ mutants were purified from growth conditioned cell culture medium using protein 
A affinity chromatography. It was found (Figure 6) that the giycosyiation site mutants, like TNFR-IgG 1. 
associated to form covalent disulfide bonded dimers that were stable in SDS and could be dissociated by the 
addition of reducing agents (e.g. 2-mercaptoethanol). As expected, the molecular mass of the giycosyiation 
variants correlated with the number of intact giycosyiation sites. Thus the variants with two giycosyiation sites 

! 5 deleted ( NNQQ and NSNQ) were smaller in molecular weight than the wild type (NNNN) proteins and larger 
in molecular mass than the QSNQ variant the possessed only one giycosyiation site. 

TNF binding studies were carried out to determine whether the secreted giycosyiation variants were 
folded into a functionally relevant conformation. For this purpose, the three giycosyiation mutants illustrated 
in Figure 6 were evaluated in a binding assay (Figure 7) using (' 25 H-labeied TNF by a method similar to that 

20 described by Ashkenazi et ai. All three mutants bound the ( I25 I)-labeIed TNF with EC 50 values (Table 2) 
comparable to that observed for fully glycosylated TNFR-IgG 1. These studies suggested that neither 
giycosyiation sites nor the tPA signal/pro sequence affected the interaction of TNFR-IgG 1 w ith it's ligand. 

EXAMPLE 8 

To determinethe influence of the tPA propeptide on secretion kinetics of tPA/TNFR- 1 gG 1 a construct 
25 was generated containing the 29aa TNFR signal sequence (SEQ ID NO: 2) and a 13 amino acid t-PA 
propeptide (SEQ ID NO: 7) having the sequence Met-GIy-Leu-Ser-Thr- Val-Pro-Asp-Leu-Leu-Leu-Pro-Leu- 
Val-Leu-Leu-Glu-Leu-Leu-Val-Gly-I^ 

Arg-Giy-Ala-Arg(SEQ ID NO: 8) was prepared. From RIP experiments SEQ ID NO: 8 was shown to have 
similar t and percent secretion at 24 hours to that of tPA/TNFR-IgGl (Figure 8 A-C). Endo H digest 

30 also reveal similar secretion kinetics to that of tPA/TNFR-IgG I , where approximately I /3 of the iysate is endo 
H resistant between 1 0 and 30 minutes and is probably secreted within 30 minutes, as opposed to the wild type 
TNFr-IgG in which very little of the lysate is Endo H resistant and the band does not decrease overtime. This 
evidence strongly indicates that the decrease in t l/2max and percent secretion at 24 hours for tPA/TNFR-IgG I 
is due to the presence of the 1 3 amino acid propeptide. This is supported by transient expression data, wherein 

35 14 yields similar total expression as well as specific productivity. 
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WHAT IS CLAIMED IS: 

I. A DNA construct comprising; 

a tirst DNA segment encoding a precursor polypeptide comprising a pro-sequence of a mammalian 
t-PA; and 

a second DNA segment operably linked to the first DNA sequence, the second DNA sequence 
encoding a heterologous glycoprotein. 

2. The DNA construct of claim 1 wherein the heterologous glycoprotein is an immunoadhesin. 

3. The DNA construct of claim 2 wherein the immunoadhesin is a TNF receptor immunoadhesin. 

4. The DNA construct of claim 3 wherein the TNF receptor immunoadhesin is TNFRl-IeGl. 

5. The DNA construct of claim 1 wherein the DNA encoding the mammalian t-PA pro-sequence is 
operably linked to a pre-sequence other than a mammalian t-PA pre-sequence. 

6. The DNA construct of claim 5 wherein the heterologous glycoprotein is and immunoadhesin. 

7. The DNA construct of claim 6 wherein the immunoadhesin is a TNF receptor immunoadhesin. 

8. The DNA construct of claim 7 wherein the TNF receptor immunoadhesin is TNFR 1 -IgG 1 . 

9. The DNA construct of claim 5 wherein the mammalian t-PA pro-sequence is operably linked a pre- 
sequence associated with the native heterologous polypeptide. 

10. The DNA construct of claim 9 wherein the heterologous glycoprotein is a TNF receptor 
immunoadhesin and the pre-sequence is a pre-sequence of a mammalian TNF receptor. 

11. The DNA construct of claim 10 wherein the mammalian t-PA pro-sequence is SEQ ID NO: 7. 

12. The DNA construct of claim 1 1 wherein the pre-sequence is SEQ ID NO: 8. 

13. The DNA construct of claim 12 wherein the TNF receptor immunoadhesin is TNFRl-IgGl. 

14. A DNA construct comprising: 

a first DNA segment encoding a precursor peptide; and 
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a second DNA segment operably linked to the first DNA sequence, the second DNA sequence 
encoding a heterologous glycosylation site variant glycoprotein. 

15. The DNA construct of claim 14 further comprising one or more additional DNA segments operably 
linked to the first and second DNA segments. 

16. The DNA construct of claim 14 wherein the precursor peptide comprises the pro-sequence of a 
mammalian t-PA. 

17. The DNA construct of claim 16 wherein the pro-sequence is a human t-PA pro-sequence. 

18. The DNA construct of claim 17 wherein the pro-sequence is SEQ [D NO: 5. 

19. The DNA construct of claim 16 further comprising a pre-sequence of a mammalian t-PA. 

20. The DNA construct of claim 19 wherein the pre-sequence is a human t-PA pre-sequence. 

21. The DNA construct of claim 20 wherein the pre-sequence is SEQ ID NO: 3. 

22. The DNA construct of claim 21 wherein the precursor peptide is SEQ ID NO 1. 

23. The DNA construct of claim 1 4 wherein the heterologous glycosylation site variant is a glycosylation 
site addition variant. 

24. The DNA construct of claim 1 4 wherein the heterologous glycosylation site variant is a glycosylation 
site deletion variant. 

25. The DNA construct of claim 24 wherein the heterologous glycosylation site variant is an 
immunoadhesin. 

26. The DNA construct of claim 25 wherein the immunoadhesin is TNFR-lgG. 

27. The DNA construct of claim 26 wherein the TNFR-lgG is TNFR 1 -IgG 1 . 

28. The DNA construct of claim 27 wherein the TNFR 1 -IgG 1 has an N-linked glycosylation site selected 
from the group consisting of amino acid positions 14, 105, 1 1 1 and 248 deleted. 

29. The DNA construct of claim 28 wherein the TNFR 1 -IgG 1 has the N-linked site at 14 deleted. 
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30. A cultured eukaryotic host ceil comprising a DNA construct comprising:: 

a first DNA segment encoding a precursor peptide corresponding to a mammalian tissue plasminogen 
activator secretory peptide; and 

a second DNA segment operably linked to the first DNA sequence, the second DNA sequence 
encoding a heterologous glycosyiation site variant. 

31. The cultured eukaryotic host cell of claim 30 wherein the host cell is a rodent host cell. 

32. The cultured eukaryotic host cell of claim 3 1 which is a CHO cell. 

33. A method of producing a polypeptide which has been altered to delete one or more native N-linked 
glycosyiation sites comprising the steps of 

(a) culturing a eukaryotic host cell comprising a DNA construct comprising: 

a first DNA segment encoding a precursor peptide corresponding to a mammalian tissue plasminogen 
activator signal-pro peptide: and 

a second DNA segment operably linked to the first DNA sequence, the second DNA sequence 
encoding a heterologous glycosyiation site deletion variant polypeptide; 

wherein the eukaryotic host cell express the first and second DNA segments and the polypeptide is 
secreted from the cell; and 

(b) recovering the polypeptide so produced. 
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Sequence Listing 

<110> Genentech, Inc. 

<12 0> PROTEIN SECRETION 

<130> P1055R1PCT 

5 <141> 1999-04-14 

<150> US 60/082 , 002 
<151> 1998-04-16 

<150> US 60/ 
<151> 1999-03-08 

10 <160> 13 

<210> 1 
<211> 35 
<212> PRT 

<213> Homo sapiens 
15 <400> 1 

Met Asp Ala Met Leu Arg Gly Leu Cys Cys Val Leu Leu Leu Cys 
1 5 10 .15 

Gly Ala Val Phe Val Ser Pro Ser Gin Glu He His Ala Arg Phe 
20 25 30 

20 Arg Arg Gly Ala Arg 

35 

<210> 2 
<211> 29 
<212> PRT 
25 <213> Homo sapiens 

<400> 2 

Met Gly Leu Ser Thr Val Pro Asp Leu Leu Leu Pro Leu Val Leu 
1 5 io is 

Leu Glu Leu Leu Val Gly He Tyr Pro Ser Gly Val He Gly 
30 20 25 29 

<210> 3 

<211> 21 

<212> PRT 

<213> Homo sapiens 

35 <400> 3 

Met Asp Ala Met Leu Arg Gly Leu Cys Cys Val Leu Leu Leu Cys 
15 10 is 

Gly Ala Val Phe Val Ser 
20 21 
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<210> 4 
<211> 11 
<212> PRT 

<213> Homo sapiens 
5 <400> 4 

Pro Ser Gin Glu lie His Ala Arg Phe Arg Arg 
1 5 10 11 

<210> 5 
<211> 14 
10 <212> PRT 

<213> Homo sapiens 

<400> 5 

Pro Ser Gin Glu He His Ala Arg Phe Arg Arg Gly Ala Arg 
15 10 14 

15 <210> 6 
<211> 32 
<212> PRT 

<213> Homo sapiens 



20 



<400> 6 

Met Asp Ala Met Leu Arg Gly Leu Cys Cys Val Leu Leu Leu Cys 
1 5 10 is 

Gly Ala Val Phe Val Ser Pro Ser Gin Glu He His Ala Arg Phe 
20 25 30 

Arg Arg 
25 32 

<210> 7 
<211> 13 
<212> PRT 

<213> Homo sapiens 
30 <400> 7 

Ser Gin Glu He His Ala Arg Phe Arg Arg Gly Ala Arg 
1 5 10 13 

<210> 8 
<211> 42 
35 <212> PRT 

<213> Artificial 

<220> 

<223> Artificial sequence 
<400> 8 

Met Gly Leu Ser Thr Val Pro Asp Leu Leu Leu Pro Leu Val Leu 
1 5 10 is 

Leu Glu Leu Leu Val Gly He Tyr Pro Ser Gly Val He Gly Ser 
20 25 30 



40 
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Gin Glu He His Ala Arg Phe Arg Arg Gly Ala Arg 
35 40 42 

<210> 9 
<211> 25 
5 <212> PRT 

<213> Herpesvirus 

<400> 9 

Met Gly Gly Thr Ala Ala Arg Leu Gly Ala Val He Leu Phe Val 
15 io 15 

10 Val He Val Gly Leu His Gly Val Arg Gly 

20 25 

<210> 10 
<211> 21 
<212> PRT 
15 <213> Homo sapiens 

<400> 10 

Met Arg Gly Lys Leu Leu Gly Ala Leu Leu Ala Leu Ala Ala Leu 
1 5 10 15 

Leu Gin Gly Ala Val Ser 
20 20 21 

<210> 11 
<211> 19 
<212> PRT 

<213> Homo sapiens 
25 <400> 11 

Met Gly Trp Ser Cys He He Leu Phe Leu Val Ala Thr Ala Thr 
1 5 io 15 

Gly Val His Ser 
19 

30 <210> 12 
<211> 22 
<212> PRT 

<213> Homo sapiens 
<400> 12 

35 Met Asp Ala Met Leu Arg Gly Leu Cys Cys Val Leu Leu Leu Cys 
1 5 io 15 

Gly Ala Val Phe Val Ser Pro 
20 22 

<210> 13 
40 <211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 13 
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Ser Gin Glu He His Ala Arg Phe Arg Arg 
15 10 
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